Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Overview of the INEX 2012 Social Book Search Track

Identifieur interne : 000327 ( Main/Exploration ); précédent : 000326; suivant : 000328

Overview of the INEX 2012 Social Book Search Track

Auteurs : Marijn Koolen [Pays-Bas] ; Gabriella Kazai [États-Unis] ; Jaap Kamps [Pays-Bas] ; Michael Preminger [Norvège] ; Antoine Doucet [France] ; Monica Landoni [France, Suisse]

Source :

RBID : Hal:hal-01071790

Abstract

The goal of the INEX 2012 Social Book Search Track is to evaluate approaches for supporting users in reading, searching, and nav- igating book metadata and full texts of digitised books as well as asso- ciated user-generated content. The investigation is focused around two tasks: 1) the Social Book Search task investigates the complex nature rel- evance in book search and the role of user information and traditional and user-generated book metadata for retrieval, 2) the Prove It task evaluates focused retrieval approaches for searching pages in books that support or refute a given factual claim. There are two additional tasks that did not run this year. The Structure Extraction task tests automatic techniques for deriving structure from OCR and layout information, and the Active Reading Task aims to explore suitable user interfaces for eBooks en- abling reading, annotation, review, and summary across multiple books. We report on the setup and the results of the two search tasks.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Overview of the INEX 2012 Social Book Search Track</title>
<author>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-28609" status="VALID">
<orgName>Microsoft Research [Redmond]</orgName>
<desc>
<address>
<addrLine>One Microsoft Way, Redmond, WA 98052, USA</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://research.microsoft.com/</ref>
</desc>
<listRelation>
<relation active="#struct-379481" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-379481" type="direct">
<org type="institution" xml:id="struct-379481" status="VALID">
<orgName>Microsoft Corporation [Redmond, Wash.]</orgName>
<desc>
<address>
<country key="US"></country>
</address>
<ref type="url">https://www.microsoft.com/fr-fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Kamps, Jaap" sort="Kamps, Jaap" uniqKey="Kamps J" first="Jaap" last="Kamps">Jaap Kamps</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-268127" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="NO"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-380120" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-380120" type="direct">
<org type="institution" xml:id="struct-380120" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Norvège</country>
</affiliation>
</author>
<author>
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-388300" status="VALID">
<orgName>Equipe Hultech - Laboratoire GREYC - UMR6072</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-150" type="direct"></relation>
<relation name="UMR6072" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300358" type="indirect"></relation>
<relation active="#struct-300266" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-150" type="direct">
<org type="laboratory" xml:id="struct-150" status="VALID">
<orgName>Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen</orgName>
<orgName type="acronym">GREYC</orgName>
<desc>
<address>
<addrLine>Boulevard du Maréchal Juin - 14050 CAEN Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.greyc.fr</ref>
</desc>
<listRelation>
<relation name="UMR6072" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300358" type="direct"></relation>
<relation active="#struct-300266" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6072" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300358" type="indirect">
<org type="institution" xml:id="struct-300358" status="VALID">
<orgName>Ecole Nationale Supérieure d'Ingénieurs de Caen</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300266" type="indirect">
<org type="institution" xml:id="struct-300266" status="INCOMING">
<orgName>Université de Caen Basse-Normandie</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Caen</settlement>
<region type="region" nuts="2">Basse-Normandie</region>
</placeName>
<orgName type="university">Université de Caen Basse-Normandie</orgName>
</affiliation>
</author>
<author>
<name sortKey="Landoni, Monica" sort="Landoni, Monica" uniqKey="Landoni M" first="Monica" last="Landoni">Monica Landoni</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-93267" status="VALID">
<orgName>University of Lugano</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-305421" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-305421" type="direct">
<org type="institution" xml:id="struct-305421" status="INCOMING">
<orgName>University of Lugano</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lugano</settlement>
<region nuts="3" type="region">Canton du Tessin</region>
</placeName>
<country>Suisse</country>
<orgName type="university">Université de la Suisse italienne</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01071790</idno>
<idno type="halId">hal-01071790</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-01071790</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-01071790</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Hal/Corpus">000096</idno>
<idno type="wicri:Area/Hal/Curation">000096</idno>
<idno type="wicri:Area/Hal/Checkpoint">000074</idno>
<idno type="wicri:Area/Main/Merge">000331</idno>
<idno type="wicri:Area/Main/Curation">000327</idno>
<idno type="wicri:Area/Main/Exploration">000327</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Overview of the INEX 2012 Social Book Search Track</title>
<author>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-28609" status="VALID">
<orgName>Microsoft Research [Redmond]</orgName>
<desc>
<address>
<addrLine>One Microsoft Way, Redmond, WA 98052, USA</addrLine>
<country key="US"></country>
</address>
<ref type="url">http://research.microsoft.com/</ref>
</desc>
<listRelation>
<relation active="#struct-379481" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-379481" type="direct">
<org type="institution" xml:id="struct-379481" status="VALID">
<orgName>Microsoft Corporation [Redmond, Wash.]</orgName>
<desc>
<address>
<country key="US"></country>
</address>
<ref type="url">https://www.microsoft.com/fr-fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>États-Unis</country>
</affiliation>
</author>
<author>
<name sortKey="Kamps, Jaap" sort="Kamps, Jaap" uniqKey="Kamps J" first="Jaap" last="Kamps">Jaap Kamps</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-120654" status="VALID">
<orgName>University of Amsterdam [Amsterdam]</orgName>
<desc>
<address>
<addrLine>Spui 21 1012 WX Amsterdam</addrLine>
<country key="NL"></country>
</address>
<ref type="url">http://www.uva.nl/en/home</ref>
</desc>
<listRelation>
<relation active="#struct-303011" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-303011" type="direct">
<org type="institution" xml:id="struct-303011" status="VALID">
<orgName>University of Amsterdam</orgName>
<orgName type="acronym"> UvA</orgName>
<desc>
<address>
<country key="NL"></country>
</address>
<ref type="url">https://www.uva.nl/en/home</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Pays-Bas</country>
</affiliation>
</author>
<author>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-268127" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="NO"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-380120" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-380120" type="direct">
<org type="institution" xml:id="struct-380120" status="INCOMING">
<orgName>Oslo and Akershus University College of Applied Sciences</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Norvège</country>
</affiliation>
</author>
<author>
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-388300" status="VALID">
<orgName>Equipe Hultech - Laboratoire GREYC - UMR6072</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-150" type="direct"></relation>
<relation name="UMR6072" active="#struct-441569" type="indirect"></relation>
<relation active="#struct-300358" type="indirect"></relation>
<relation active="#struct-300266" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-150" type="direct">
<org type="laboratory" xml:id="struct-150" status="VALID">
<orgName>Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen</orgName>
<orgName type="acronym">GREYC</orgName>
<desc>
<address>
<addrLine>Boulevard du Maréchal Juin - 14050 CAEN Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.greyc.fr</ref>
</desc>
<listRelation>
<relation name="UMR6072" active="#struct-441569" type="direct"></relation>
<relation active="#struct-300358" type="direct"></relation>
<relation active="#struct-300266" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle name="UMR6072" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="IdRef">02636817X</idno>
<idno type="ISNI">0000000122597504</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300358" type="indirect">
<org type="institution" xml:id="struct-300358" status="VALID">
<orgName>Ecole Nationale Supérieure d'Ingénieurs de Caen</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
<tutelle active="#struct-300266" type="indirect">
<org type="institution" xml:id="struct-300266" status="INCOMING">
<orgName>Université de Caen Basse-Normandie</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Caen</settlement>
<region type="region" nuts="2">Basse-Normandie</region>
</placeName>
<orgName type="university">Université de Caen Basse-Normandie</orgName>
</affiliation>
</author>
<author>
<name sortKey="Landoni, Monica" sort="Landoni, Monica" uniqKey="Landoni M" first="Monica" last="Landoni">Monica Landoni</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-93267" status="VALID">
<orgName>University of Lugano</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
<listRelation>
<relation active="#struct-305421" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-305421" type="direct">
<org type="institution" xml:id="struct-305421" status="INCOMING">
<orgName>University of Lugano</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Lugano</settlement>
<region nuts="3" type="region">Canton du Tessin</region>
</placeName>
<country>Suisse</country>
<orgName type="university">Université de la Suisse italienne</orgName>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The goal of the INEX 2012 Social Book Search Track is to evaluate approaches for supporting users in reading, searching, and nav- igating book metadata and full texts of digitised books as well as asso- ciated user-generated content. The investigation is focused around two tasks: 1) the Social Book Search task investigates the complex nature rel- evance in book search and the role of user information and traditional and user-generated book metadata for retrieval, 2) the Prove It task evaluates focused retrieval approaches for searching pages in books that support or refute a given factual claim. There are two additional tasks that did not run this year. The Structure Extraction task tests automatic techniques for deriving structure from OCR and layout information, and the Active Reading Task aims to explore suitable user interfaces for eBooks en- abling reading, annotation, review, and summary across multiple books. We report on the setup and the results of the two search tasks.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Norvège</li>
<li>Pays-Bas</li>
<li>Suisse</li>
<li>États-Unis</li>
</country>
<region>
<li>Basse-Normandie</li>
<li>Canton du Tessin</li>
</region>
<settlement>
<li>Caen</li>
<li>Lugano</li>
</settlement>
<orgName>
<li>Université de Caen Basse-Normandie</li>
<li>Université de la Suisse italienne</li>
</orgName>
</list>
<tree>
<country name="Pays-Bas">
<noRegion>
<name sortKey="Koolen, Marijn" sort="Koolen, Marijn" uniqKey="Koolen M" first="Marijn" last="Koolen">Marijn Koolen</name>
</noRegion>
<name sortKey="Kamps, Jaap" sort="Kamps, Jaap" uniqKey="Kamps J" first="Jaap" last="Kamps">Jaap Kamps</name>
</country>
<country name="États-Unis">
<noRegion>
<name sortKey="Kazai, Gabriella" sort="Kazai, Gabriella" uniqKey="Kazai G" first="Gabriella" last="Kazai">Gabriella Kazai</name>
</noRegion>
</country>
<country name="Norvège">
<noRegion>
<name sortKey="Preminger, Michael" sort="Preminger, Michael" uniqKey="Preminger M" first="Michael" last="Preminger">Michael Preminger</name>
</noRegion>
</country>
<country name="France">
<region name="Basse-Normandie">
<name sortKey="Doucet, Antoine" sort="Doucet, Antoine" uniqKey="Doucet A" first="Antoine" last="Doucet">Antoine Doucet</name>
</region>
<name sortKey="Landoni, Monica" sort="Landoni, Monica" uniqKey="Landoni M" first="Monica" last="Landoni">Monica Landoni</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000327 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000327 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01071790
   |texte=   Overview of the INEX 2012 Social Book Search Track
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024